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Abstract 

I provide operational postulates for quantum theory. These involve 
certain operational notions. Systems come in different types, a,b,c, .... 
A maximal set of distinguishable states is any set containing the maximum 
number, N 3 , of states for which there exists some measurement, called a 
maximal measurement, which can identify which state from the set we 
have in a single shot. A maximal effect is associated with each result of 
a maximal measurement. An informational face is the full set of states 
that give rise only to some subset of outcomes of some maximal measure- 
ments (it corresponds to constraining the system to have some reduced 
information carrying capacity). States are represented by vectors whose 
K 3 entries are probabilities. A set of states is said to be non-flat if it is a 
spanning subset of some informational face. A filter is a transformation 
that passes unchanged those states in a given informational face while 
blocking those states in the complement informational face (that would 
give rise only to outcomes in the complement outcome set of the maximal 
measurement). Classical probability theory and quantum theory are the 
only two theories consistent with the following set of postulates. 

PI Logical sharpness. There is a one-to-one map between pure states 
and maximal effects such that we get unit probability. 

P2 Information locality. A maximal measurement is effected on a com- 
posite system if we perform maximal measurements on each of the 
components (or, equivalently, iV a b = N a Nb). 

P3 Tomographic locality. The state of a composite system can be deter- 
mined from the statistics collected by making measurements on the 
components (or, equivalently, if a b = K s Kt). 

P4' Permutability. There exists a reversible transformation on any sys- 
tem effecting any given permutation of any given maximal set of 
distinguishable states for that system. 

P5 Sturdiness. Filters are non-flattening. 

To single out quantum theory we need only add any requirement that is 
inconsistent with classical probability theory and consistent with quantum 
theory. 
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1 Motivation 



The standard axioms of QT are rather ad hoc. Where does this structure come 
from? Can we write down natural axioms, principles, laws, or postulates from 
which can derive this structure? Compare with the Lorentz transformations 
and Einstein's two postulates for special relativity. Or compare with Kepler's 
Laws and Newton's Laws. The standard axioms of quantum theory look rather 
ad hoc like the Lorentz transformations or Kepler's laws. Can we find a natural 
set of postulates for quantum theory that are akin to Einstein's or Newton's 
laws? 

The real motivation for finding deeper postulates for quantum theory is that 
it may help us go beyond quantum theory to a theory of quantum gravity (just as 
Einstein's work helped him go beyond special relativity to his theory of General 
Relativity). It is in the finding of new physics that we can expect a real payoff 
of this program. 

In [29j I showed how classical probability theory and quantum theory are 
the only two theories consistent with the set of postulates given above in the 
abstract. In this chapter I will explain the meaning of these postulates and 
indicate how the main steps of the proof work. The reconstruction takes place 
in the context of the circuit framework which I will describe. 

2 A personal history of reconstruction 

A dozen years or so years ago Christopher Fuchs implored the community to 
"find an information theoretic reason" for axioms of QT (in multiple talks and 
a few papers [13 US]). Further, Chris Fuchs and Gilles Brassard invited me to 
a workshop in Montreal in 2000 on this issue amongst others (see notes in [16]). 
I accepted their invitation but was, in the event, unable to attend. However, I 
was already hooked. The work I began preparing for that workshop led to my 
paper "Quantum theory from five reasonable axioms" [22] . In modern form (see 
[28]), the axioms given there can be stated in the following way: 

Information Systems having, or constrained to have, a given information car- 
rying capacity have the same properties. 

Information locality Same as P2 above. 

Tomographic locality Same as P3 above. 

Continuous reversibility There exists a continuous reversible transformation 
between any pair of pure states. 

Simplicity States are specified by the smallest number of probabilities consis- 
tent with the other axioms. 

The simplicity axiom stands out as being less reasonable than the others. If we 
drop it then we may get a hierarchy of theories. This leads to two possibilities. 
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Either there do exist higher theories in this hierarchy or there do not. For many 
years I tried to find such theories, and I tried to prove that such theories do not 
exist. I also tried to find other reasonable axioms that rule out higher theories 
in this hierarchy. It was not until 2009 that progress was made by others. In 
2009 Chiribella, D'Ariano, and Perinotti (CDP) [7J showed how considerations 
concerning teleportation can be used to get rid of the need for a simplicity axiom 
in certain contexts. In 2010 [6] they found a set of postulates for quantum 
theory which, by virtue of the techniques developed in [7J, did not require a 
simplicity axiom. Independently Dakic and Brukner |10) showed how one can 
replace the simplicity axiom with the assumption that any state for a two level 
system can be written as a mixture of perfectly distinguishable states (modulo 
some technical problems in their proof arising from the unfortunate existence 
of a subgroup of SO (7) that is transitive on the 6-sphere). In 2010 Masanes 
and Miiller [35] showed how to replace the simplicity axiom with a different 
axiom saying that all mathematically well defined measurements for a two-level 
system are allowed. The axiom sets of Dakic and Brukner, and of Masanes 
and Miiller are slight modifications on my original axiom set from 2001. The 
axioms of CDP are quite different (except for the assumption of tomographic 
locality). Masanes, Miiller, Augusiak, and Perez-Garca provide another set of 
axioms employing tomographic locality, continuous reversibility, and another 
axiom concerning the existence of an informational unit. Another set of axioms 
using tomographic locality was given by Marco Zaopo [45] . 

There has been much work recently by many people along less related lines 
(Fuchs [17 , Goyal [19], Wilce [42], Rau [SH [39], Fivel [13], . . . ). Further, there 
is, in fact, a long history of attempts to reconstruct QT (von Neumann [4Tj . 
Mackey [33] , Birkoff and von Neumann [4] , Zierler [46] , Piron [37] Ludwig [32] , 
Rovelli [ID], and many others). 

Many of these reconstruction attempts employ the so called "convex proba- 
bilities framework" . This goes back to originally to Mackey and has been worked 
on (and sometimes rediscovered) by many others since including Ludwig [3"2"] . 
Davies and Lewis [IT], Gunson [3J, Mielnik [55], Araki g], Gudder et al. [2"U] . 
Foulis and Randall [U] , Fivel [T2] as well as more recent incarnations [121 [3J . 

The circuit framework used here [37] [55] (see also [2H1I2S]) might be regarded 
as a marriage of the convex probabilities framework and the pictorial (or cate- 
gorical) approach of Abramsky and Coecke [U [9] . A similar framework has been 
developed by Chiribella, D'Ariano, and Perinotti [7J. The pictorial approach of 
Abramsky and Coecke is important because of its emphasis on composition as 
a basic primitive. 

In 2002 Clifton, Bub, and Halvorson [8] were inspired by a suggestion of 
Fuchs and Brassard to take a different approach to reconstructing quantum 
theory. They showed that some features of quantum theory follow if one imposes 
no-bit-commitment, no-broadcasting, and no-signalling within the C* algebraic 
framework (rather than the convex probabilities framework). 
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3 The circuit framework 



In this section we will present the circuit framework. The basic idea is that 
circuits can be built from operations. An operation corresponds to one use 
of an apparatus with some particular outcome, or subset of possible outcomes 
specified. Operations have some number of systems as inputs and some num- 
ber of systems as outputs. We can wire together operations. If we have no 
open inputs or outputs left over then we have a circuit. We employ three back- 
ground assumptions for this framework. The main one is that we can associate 
a probability with a circuit (the joint probability that the outcomes are in the 
associated outcome sets on each operation). 

3.1 Operations 

We can notate an operation diagrammatically or symbolically as follows. 



The integer subscripts in the symbolic representation will be used to show where 
the wires go and have no significance beyond this. An operation, A, corresponds 
to one use of an apparatus and has the following features. 

• Inputs and outputs. Come in various types, a, b, The inputs cor- 
respond to wires going in the bottom of the box in the diagrammatic 
notation, or subscripts in the symbolic notation. The outputs correspond 
to wires coming out the top and to superscripts. 

• A setting, s(A). We can think of this as corresponding to certain positions 
for knobs, buttons, and dials that may be on the apparatus. 

• An outcome set, o(A). This is a subset of all the possible outcomes for 
this use of the apparatus. 

If xa G o(A) then we say operation A "happened" . If we have a different setting, 
or a different outcome set, then we have a different operation and should notate 
this with a different letter (e.g. B rather than A). 



b c 



A 




a b b 
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3.2 Wires 

Outputs can be connected to inputs by wires. 



d c 



B 



7 



b\ c 
I 



A 

m 

a a b 



fibiCs gd 7 c 8 
aia2b3 a6b4 



Note how the wire linking the two boxes corresponds to the repeated integer 
(the 4 on bn). These diagrams are interpreted graphically. In particular, vertical 
position has no meaning. We can distort the graph in any way we wish without 
changing the physical meaning so long as the wires remain attached to the same 
positions on the boxes and the boxes maintain their orientations. 
There are certain wiring rules. 

• One wire: At most one wire can be connected to any given input or output. 

• Type matching: Wires can connect inputs and outputs of the same type. 

• No closed loops: If we trace from output to input along wires through the 
operations then we cannot get back to the operation we started at. 

The last rule is to rule out closed time like loops. 
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3.3 Fragments 

The most general object we can consider is a collection of operations wired 
together. For example, 




Fragment E 



c a 



A cib 2 d 3 r>b 6 a7 /~a9Ciodn r>bi 3 ai4 
bi 5 ci6 c 4 a 5 ci ^-a7b2d3b 8 ci 2 a 9 cio 



Such objects are called fragments (as they are fragments of circuits). In general 
fragments may have open inputs and outputs. Fragments have 

• A setting, s(E), given by specifying the setting on each operation. 

• An outcome set, o(E), (equals o(A) x o(B) x o(C) x o(A) in this case). We 
say the fragment "happened" if the outcome is in the outcome set. 

• A wiring, w(E), given by specifying the input/output pairs which are wired 
together. 



3.4 Circuits 

Circuits have no open inputs or outputs. For example, 

Circuit H 




A aia2b3 B a ^C^D£ a4 E d5a2C6C7 



Circuits are special cases of fragments. Circuits have 



G 



• A setting, s(H), given by specifying the setting on each operation. 



• An outcome set, o(H), given by specifying the outcome set at each op- 
eration (equals o(A) x o(B) x o(C) x o(D) x oE in this case). We say the 
fragment "happened" if the outcome is in the outcome set. 

• A wiring, w(E), given by specifying which input/output pairs are wired 
together. 

3.5 Preparations, results, and transformations 

A preparation is a fragment having only outputs. Here are some examples: 



aba 



B 



a b 



c d 



We will associate states with preparations. 

A result is a fragment having only inputs. Here are some examples: 



C 



D 

TTX 

a c c 



75l 



we will associate effects with results. A measurement is a collection of results 
corresponding to the same setup with disjoint outcome sets whose union is the 
set of all outcomes for this setup. 

A transformation is a fragment that has inputs and outputs that is used 
in transformation mode. Here are some examples 



b b a 



B 

7~X 

a c 



A fragment is used in transformation mode if we do not feed outputs into inputs 
on this fragment (either directly or indirectly). 

A transformation, B^, is reversible if there exists another transformation, 
B^, such that B^B^ is the identity transformation. Note that the identity 
transformation is defined to have the property that, if it inserted on any wire 
in any circuit, then the probability for that circuit remains unchanged. 
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3.6 The first background assumption 

We need three background assumptions in setting up the circuit framework. 
The first is the following. 

Assump 1. We can associate a probability with any given circuit 
(the probability that the circuit "happens"), and this probability de- 
pends only on the specification of the given circuit (the knob settings 
and outcome sets at the operations, and the wiring). 

For example, 



Prob 




is well conditioned 



Note that we make this assumption for circuits, not for fragments generally. 
Indeed, a fragment with open inputs and/or outputs cannot be expected to 
satisfy this since the probability may depend on what is done with these open 
ports. 

3.7 The state 

Want to associate a state with a preparation 



B 



a b 



c d 



There exist many results which complete this into a circuit. Here are a few 
examples: 



a b 



c d 
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The state associated with a preparation should enable us to predict the proba- 
bility for every circuit containing this preparation. 

We could specify the state associated with a preparation by giving a list 
of probabilities for every circuit made with this preparation. This would be a 
very long list and rather cumbersome to work with. However, physical theories 
typically relate different quantities. Consequently it should be possible to pick 
out a subset results such that specifying the probabilities for just the circuits 
containing the given preparation and results from this subset is sufficient to allow 
us to calculate the probability for any other circuit containing this preparation. 
For example, in Quantum Theory we can calculate all the probabilities for a 
spin-half particle from just the probabilities 

/ Px+ \ 

Py+ 

Pz + 

\P- J 

as these suffice to determine the density matrix for this system. In fact, these 
probabilities suffice to determine the elements of the density matrix by linear 
equations. We will insist on linearity in what follows. This is well justified when 
one considers taking mixtures of states (see Appendix B of [33] for example). 
We call the choice of results used to specify the state fiducial results. In 
general this choice is not unique. 

3.8 Using Fiducial results to define states 

It is worth paying attention to the font used in the notation below. Consider 
preparations of the form A ai . We choose a fiducial set of results 

X^ 1 for 01 = 1 to K 3 

The state associated with preparation A ai is given by 

A ai := Prob(A ai X^) 

A fiducial set is a minimal set such that, for any result, B ai there exists an 
effect B ai such that 

Prob(A ai B ai ) = A ai B ai (summation over a% implied) 

This is linear. If we allow arbitrary mixtures then must have linearity here |29j . 
However, even if we do not allow arbitrary mixtures, we are free to consider 
only linear relations of this type even though there may be a more efficient non- 
linear expression. Associated with preparation A ai is the state, A ai . This 
is a list of K a fiducial probabilities from which all other probabilities can be 
calculated. Associated with the result B ai is the effect B ai . This is a list of 
K 3 real coefficients (which can be negative). 
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3.9 Pure states 



A mixed state is one that can be simulated by a mixture of preparations. I.e. 

where < A < 1 and B ai ^ C ai . A pure state is one that cannot be simulated 
by a mixture of preparations. A transformation is non-mixing if it preserves 
purity (up to normalization). 



3.10 Maximal sets 

A very important notion is that of a maximal set. 

A maximal set of distinguishable states is any set containing 
the maximum number, N a , of states for which there exists some 
measurement, called a maximal measurement, which can identify 
which state from the set we have in a single shot. 

We also need the following notion. 

A maximal effect is associated with each outcome of a maximal 
measurement. 

We can notate these notions diagrammatically as 



Prob 



\ | A N / 



or symbolically as 



A^[n]B ai [m}=5 n 



where m, n = 1 to N a . 

In quantum theory maximal sets of distinguishable states are associated with 
an orthonormal basis. Then N a is the dimension of the Hilbert space, maximal 
measurements correspond to non-degenerate observables, and maximal effects 
correspond to rank-one projectors. 

In classical probability theory there is a unique maximal set of distinguish- 
able states and it is usually understood to correspond to the underlying states 
of reality. 



3.11 Two more assumptions for framework 

We need two more background assumptions for the circuit framework. 

Assump 2. There exists at least one type of system having N 3 > 1 
and K 3 finite. 
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Recall that N a is the maximum number of states in a distinguishable set and 
K 3 is the number of probabilities that must be provided to specify the state. In 
quantum theory we note that systems having finite N a also have finite K a . 
We will give the third assumption without defining all the terms. 

Assump 3. If, for any accuracy 6 > 0, there exists a fragment A[S] 
that is operationally indiscernible from a given hypothetical frag- 
ment, Q, then there actually exists a fragment with the probabilistic 
properties of Q. 

This assumption is used to obtain the property that the space of fragments is 
compact in an appropriate sense. The reader is referred to [29] for more details. 

3.12 Permutation transformations 

We can define permutation transformations with respect to a given maximal set 
of distinguishable states 



A[7r(n)] 



for some permutation tt. That is, a permutation transformation permutes the 
elements of a maximal set of distinguishable states. 



3.13 PI 

For the purpose of clarity, it is worth discussing the first postulate at this stage. 

PI Logical Sharpness. There is a one-to-one map between pure states and 
maximal effects such that we get unit probability. 

This means that for any given pure state there is a unique maximal effect giving 
unit probability, and that for any given maximal effect there is a unique pure 
state giving unit probability. In pictures, there is a one-to-one map between 
pure states and maximal effects: 



such that 



Prob 



U 



Interestingly, causality follows from this postulate. This is the property that 
choices in the future do not influence probabilities the past. The causality 
property was introduced by CDP as corresponding to the existence of a unique 
deterministic effect [7\ and used as a postulate in their reconstruction [6]. 
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3.14 Informational faces and non-flat sets of states 



An informational face, S, is the full set of states having support only on 
some subset, O5, of the outcomes of some maximal measurement, {B ai [m]}. 
Basically, these are sets of states constrained to have a certain information 
carrying capacity. The states in S have support on the complement subset 
of outcomes, O5, for the same maximal measurement. In convex geometry a 
face is given by the intersection of the convex set in question and a supporting 
hyperplane. A supporting hyperplane is one which has no elements of the convex 
set on one side. The supporting hyperplane defining S is given by the equation 



Faces are, themselves, convex sets. In quantum theory all faces are, in fact, 
informational faces by virtue of the spectrality property (any state can be written 
as a convex combination of states in a maximal distinguishable set). However, 
this need not be the case and we do not assume this here. 

A set of states is non-flat if it is a spanning subset of some informational 
face. It could be an over-complete spanning subset and consequently the infor- 
mational face is, itself, non-flat. If PI holds then we can think of a non-flat 
set of states as a kind of generalization of the notion of a pure state. In fact it 
follows from PI that any single member non-flat set of states consists of a state 
proportional to a pure state. Thus, we can think of a set containing a single 
pure state as being the simplest type of non-flat set in a hierarchy of bigger and 
bigger non-flat sets. 

We need the following notion to understand P5. 

A transformation is said to be non-flattening if, for any non-flat 
set of states we send in, we get a non-flat set of states out. 

It follows from PI that all non-flattening transformations are also non- mixing. 
Interestingly, in quantum theory the converse is true also: all non-mixing trans- 
formations are non-flattening. 




(1) 



3.15 Filters 



A filter, F, is defined with respect to a given informational face, S. 



A filter is a transformation that 



• passes unchanged states in S 



• blocks states in S 
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For a filter defined with respect to an informational face S given by maximal 
measurement {B[m]} and outcome set Os we have 

a 




= for m £ O s 



a 




= for m £ O s 



Here 

a 





is the preparation corresponding to the null state. The null state is the state 
that gives probability zero for any circuit it is part of. The components of the 
null state are, therefore, all equal to zero. 

4 THE POSTULATES 

Classical probability theory and quantum theory are only two theories consistent 
with the following postulates. 

PI Logical sharpness. There is a one-to-one map between pure states and 
maximal effects such that we get unit probability. 

P2 Information locality. A maximal measurement on a composite system is 
effected if we perform maximal measurements on each of the components. 
Equivalently N 3b = N 3 N b . 

P3 Tomographic locality. The state of a composite system can be determined 
from the statistics collected by making measurements on the components. 
Equivalently K 3b — K 3 K b . 

P4' Permutability. There exists a reversible transformation on any system ef- 
fecting any given permutation of any given maximal set of distinguishable 
states for that system. 

P5 Sturdiness. Filters non-flattening. 
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4.1 Ruling out the classical case 



To single out quantum theory it suffices to add anything that is inconsistent with 
classical probability and consistent with quantum theory. The key property of 
non-classical theories is that K 3 > N 3 for non-trivial systems (i.e. systems having 
N 3 > 1). One way to ensure this is to replace P4' with 

P4 Compound permutability. There exists a compound reversible transforma- 
tion on any system effecting any given permutation of any given maximal 
set of distinguishable states for that system. 

A compound transformation is one that can be made from two sequential trans- 
formations (neither equal to the identity). The advantage of this is that it 
requires only adding a single word (the word "compound") to one of the ex- 
isting postulates. However, as we just mentioned, we could add any property 
inconsistent with classical probability theory so long as it is consistent with 
quantum theory. For example, we could simply demand that there are more 
pure states than there are states in any maximal distinguishable set of states 
for non-trivial systems. 



4.2 P2 



Our second postulate is the following. 

P2 Information locality. A maximal measurement on a composite system is 
effected if we perform maximal measurements on each of the components. 

This means the set of results (with m = 1 to N a and n = 1 to Nb) 



A fa 



is a maximal measurement. 

If AT a is the maximum number of distinguishable states then P2 is equivalent 
to statement that 

N ah = N a N h 

This is very natural. For example, if we have a die (N a = 6) and a coin (Ab = 2) 
then we have N a b = 12. We call this "information locality" since the total 
information capacity is given by adding together the local information capacities: 

logAT ab = logAr a + logAT b 

This postulate is looks innocent but it is actually very powerful. Certainly we 
can imagine situations in which this postulate is not true (see |31j). 
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4.3 P3 



Our third postulate is used in many of the recent reconstructions of quantum 
theory. It can be stated in the following way. 

P3 Tomographic locality. The state of a composite system can be determined 
from the statistics collected by making measurements on the components. 

Pictorially this means we can determine the state associated with the prepara- 
tion, A aib2 , by deter mining the probabilities for circuits of the form 



X 



Y 



It follows from this that we can write the state as A aib2 where this is a list 
of joint probabilities determined by putting separate fiducial results on system 
ai and b2- In fact, more generally, it follows from tomographic locality that 
we can represent an arbitrary operation such as by a tensor . 

Actually, this fact is an equivalent statement of tomographic locality. In words 
the equivalent statement is that an arbitrary operation can be fully characterized 
by local process tomography. Then the probability for a circuit is given the 
scalar obtained by contracting over indices where there are wires in the circuit. 
For example, 

Prob(A^ b ^3f 6B ^ 5 2C3 C d4 D e5f6 ) - A a ^f°B d a f b l c C di D e5h (2) 

In [27J a tensor such as correspond to putting a more general object 

called a duotensor into standard form. Duotensors play an important role in 
the full reconstruction. However, we will not discuss them further here. 
Another equivalent statement of tomographic locality is that 

K ab = K 3 K b 

(where K a is number of probabilities required to specify state). Hence we see 
that information locality and tomographic locality are very similar postulates 
(they were grouped together in [22]). 

There exist other equivalent statements of the tomographic locality assump- 
tion (see for some of them). 



4.4 P4' 

The forth postulate concerns the ability to permute the states in a maximal set 
of distinguishable states by means of a reversible transformation. 

P4' Permutability. There exists a reversible transformation on any system ef- 
fecting any given permutation of any given maximal set of distinguishable 
states for that system. 
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In pictures we can say that exists reversible P n 

a 



A[7r(n)] 



for any maximal set of distinguishable states, A ai [n] and permutation, it. Re- 
versibility means that this transformation is reversible when applied to any state 
(not just the members of the maximal set of distinguishable states). 

This postulate implies we can perform lossless arbitrary translation of a 
message encoded with respect to any alphabet to one encoded with respect to 
any permutation of this same alphabet. 



4.5 P5 

The last postulate concerns filters. 

P5 Sturdiness. Filters are non-flattening. 

Recall that a set of states is said to be non-flat if it is a spanning subset 
some informational face. One way to think of this property is that sets of 
states resist being squashed (hence the name "sturdiness"). Quantum states 
are not as sensitive as we might have imagined. A filter is a pretty dramatic 
transformation. However, according to this postulates, sets of states remain as 
intact as they can under the circumstances. 



5 Outline of reconstruction 



The full reconstruction, while only using elementary mathematics, is rather 
lengthy. Here we will only give an outline of some of the main steps. 
First, using PI, P2, P3, and P4' we 

• show that there exists a reversible transformation between any pair of pure 
states, 

• construct arbitrary filters, 

• show there exist types with N — 1,2,3,..., 

• show that systems having same A^ are equivalent, 

• show that K 3 = N 3 where r = 1, 2, 3, . . . (the Wootters hierarchy [43"ll44]. 
Using P5 as well we 

• show that gebits (generalized bits, i.e. systems having N 3 — 2) correspond 
to hyperspheres, 
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• show that all points on the hypersphere correspond to pure states, 

• show how to do teleportation, 

• prove that K a = N a or K a = 7V a 2 . 

This gives us the bit or the qubit. Interestingly, getting the qubit is the most 
difficult part of this reconstruction as well as many others. Having got the 
qubit we get the appropriate constraints on quantum theory in general (not just 
the N a = 2 case) by showing that a certain "magic operation" can implement 
any complete set of superoperators (superoperators corresponding to a set of 
operations associated with a given apparatus with disjoint outcome sets where 
the union of these outcome sets is the full set of outcomes) . To complete this last 
step and get quantum theory in general we employ the duotensor framework [27] 
and the operator tensor formulation of quantum theory [29, 30]. We will provide 
an outline of how some aspects of the reconstruction work in the following 
subsections. For the full details of the proofs (which are mostly omitted here) 
the reader is referred to [2"9"] . 

5.1 Reversible transformation between pure states 

Let 

{U ai [n] : n = 1 to N a } and {V ai [n] : n = 1 to N a } 
be maximal sets of distinguishable preparations for a. Let 

{W b2 [m] : m = 1 to N b } 

be a maximal set of distinguishable preparations for b. We will denote the 
maximal measurement that distinguishes these maximal sets of distinguishable 
states by {U ai [n]}, {V a >]}, and {W b2 [m]}. 
It follows from P2 that 

{U ai [?i]W b2 H : nm = ll,12,...iV a iVb} 

is a maximal set for ab. Similarly, 

{V ai [?i]W b2 [m] : ?w7i= H,12,...iV a iV b } 

is another maximal set for ab. 

Let P be the reversible transformation that permutes U a ' [7i]W b2 [m] according 

to 

7Tp = (TITO <r-> 771 71 ) 

Let Q be the reversible transformation that permutes V ai [71] W b2 [771] according 
to 

ttq = (titti <-> mn) 
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Note we choose b such that Nb = N a . Then a little thought shows that 



a W[l] 

Q 

ye 

b 

p 

a W[l] 



does the job. We can prove that this is a reversible transformation and it clearly 
takes U ai [1] to V 33 [1] . In fact it actually does a bit more. It takes U ai [n] to V 33 [n] 
for n = 1 to iV a . 

5.2 Arbitrary filters are possible 

It can be shown that the transformation 



a T 

P 

b 

p 

a [Vgj 

(3) 

effects an arbitrary filter where iV a = ./Vb and ni is any integer chosen from Os- 
Here the transformation P is a permutation transformation with permutation 

( nm o ran if n and m £ 0(5*) \ . . 

y nm <-> nm otherwise J ^ ' 
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and P is the transformation that reverses P. The result Tb is a deterministic 
result (its outcome set is equal to the set of all outcomes). 

5.3 Systems with same are N equivalent 

These substitutions prove equivalence when N a — N b . 



T 



T b 



u[i; 



u[i; 



V[l] 



(5) 



v[i; 

T 



V[l] 



U[l] b 



where 



7r = (nm -H- ran) 



(6) 



With these substitutions we can replace any wire of type a by one of type b 
(and vice versa) without changing the probability for the given circuit. 

5.4 Proof that K 3 = iV a r 

It follows from the first four postulates that 

• K a = K(N 3 ) (since systems having the same N a are equivalent). 

• K(N + 1) > K(N) (since we can filter systems down). 

• K(N a N b ) = K(N a )K(N b ) (by P2). 
It can be shown that 



K a =Nl where r = 1,2,3,... 

follows (the proof of this uses the decomposition of N a and N b into prime num- 
bers). This relationship was first suggested by Wootters [131111] and hence we 
term it the Wootters hierarchy. It was first proven that this relationship follows 
from the above more basic premises in [35] 
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5.5 All points on hypersphere correspond to pure states 



It can be shown to follow from fact that there exists a reversible group of trans- 
formations between pure states that all pure states must live on surface of a 
hypersphere. We then need to show that all points on this hypersphere corre- 
spond to pure states for the gebit. The proof of this starts with a getrit (a system 
having N a — 3). We prepare a system constrained to an N a = 2 informational 
face of this getrit (i.e. we consider states contrained to a gebit space). Next we 
filter on the getrit space but with respect to a maximal measurement that has 
one maximal result having full support on the afore mentioned gebit and one 
maximal result having only partial support on this gebit. What happens is that 
the states emerging out of this filter are also gebit states but they move closer to 
one of the poles of the gebit (the one associated with the maximal result having 
full support). If we keep filtering like this then an intially non-flat set of states 
will, by P5 remain non-flat but will move closer to this pole. We can produce 
a spanning set of states that are as close to the pole as we like. It then follows 
that within an infintessimal region of the pole there must be points lying in any 
direction. Since any state could serve as the pole, this proves that all points on 
the hypersphere are populated. 



The classical case corresponds to the 1-sphere with just two pure states. If we 
are in the non-classical case then we want to prove that this hypersphere must 
be the 2-sphere corresponding to the qubit of quantum theory. Consequently, 
we want to prove that for N a = 2 we have K a = 4 (in the non-classical case). 
This means that we get the Bloch ball (since one parameter counted in K 3 
corresponds to normalization). This proof is adopted from a beautiful proof 
due to Chiribella, D'Ariano, and Perinotti [7j 16] using teleportation. 

We start by assuming that we are in the non-classical case. Consider the 
gebit preparation 



where B ai is a gebit preparation, E a2Cla is an entangled pure state in Sm^} and 
M aia2 is a certain maximal entangled effect (these do not exist in the classical 
case but must exist in the non-classical case). We can show that the transfor- 
mation on B ai is non- flattening using P5. The pure states for preparation B Ql 
lie on the surface of a hypersphere. Under the transformation in ([7]) this hyper- 
sphere is transformed to an hyper-ellipsoid. Hence we can use the preparation in 
([7]) to prepare a state proportional to any pure state by making an appropriate 
choice of preparation B. The state that is prepared is not necessarily equal to 
B ai so we do not necessarily have faithful teleportation. However, we can use 



5.6 Getting the qubit and K a = N% 



M 



(7) 
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this result to obtain the following result. 




(8) 



for any state A ai for the gebit. Now we do have faithful (probabilistic) telepor- 
tation. We work in a computational basis for the gebit denoted by and 1. Here 
B a2 is a special choice of state. In fact it must be an equatorial state - a pure 
state on the equator of the hypersphere between the two poles (these exist only 
in the non-classical case). Also, P cno t is a reversible permutation transformation 
effecting the permutation associated with the CNOT gate in the computational 
basis. 

Since we now have faithful (albeit probabilistic) teleportation, we have 



M 


l 


1 1 1 


B 


fcnot 





We can also prove that 



Prob 



< 



using the fact that B a2 is equatorial. For convenience we put 



(9) 



M 



N 


:= 


1 


R;not 











(10) 



Then 



at M a2aa — -T a3 



(11) 
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where 1^ is the identity. Hence, 

N aiaa M™ = h«l = ±K 3 (12) 

since the trace of the identity is equal to the dimension of the space on which 
it acts. But we also have 

N aia2 M a ^ < 1 (13) 
It follows that, for a gebit, K 3 < 4. Hence, K — N 2 in general. 



5.7 The magic operation 

The last part of the proof shows how to use the fact that the gebit is equal to the 
qubit along with the postulates to get quantum theory in general. The key part 
of this is showing that the following set of operations (for different outcomes / 
of a maximal measurement) 



V[l] 



I T 



1] 



Q 



(14) 



1 



can generate any complete set of operations in quantum theory. Here P and 
Q are appropriately chosen reversible permutation transformations, {(f>[n] : n = 
1 to iV a } are appropriately chosen phases, c and d are ancillary systems having 
appropriate iV c and A^, and V b2C3d4 [l] is an appropriately chosen preparation 
(for a pure state). The unlabeled wires represent qubits. T is the deterministic 
effect. This proof employs the duotensor and operator tensor frameworks and 
the reader is referred to [221 for details. 
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6 Conclusions 



I have provided a set of operational postulates from which quantum theory can 
be reconstructed. This does not require a simplicity assumption as did my 
earlier work |22] from over a decade ago. This is one of a number of recent 
reconstructions jTUl IE1 ISS1 [Ml EH] along similar lines which use the assumption 
of tomographic locality (like [35]) and do not need a simplicity assumption. 
There are strong connections between these different approaches and many of 
the proof techniques are similar. What appears as a postulate in one approach 
appears as a low level theorem in another and vice versa. One might think of 
a set of postulates as being a little akin to a choice of coordinate system used 
to represent some shape. If we find a good coordinate system then the shape 
appears simple. The fact that there are a number of good postulate sets that 
are fairly simply related to each other is similar to the fact that there are often a 
number of good choices of simply related coordinate system for viewing a shape. 
While one may have preferences for one or the other set of postulates, there is 
not really much to distinguish them. 

However, I am left with the sense that some much deeper insights are still left 
to be had. One reason for this sense is that in the operator tensor formulation 
[291 130] of quantum theory (and, similarly, in the quantum combs approach 
[5J preparations, transformations, and results are all treated on a fairly equal 
footing. However, this is not true of the postulate set presented here or the 
others I have mentioned. Surely the postulates should also treat all kinds of 
operations on an equal footing (so far as this is possible). Further motivation 
for this comes from quantum gravity. We do not yet have a theory of quantum 
gravity. However, when we do, it seems likely that we will have to contend 
with indefinite causal structure. We will not be able to say whether some 
particular interval is space-like, time-like, or null, but rather can expect to 
have something like a quantum superposition of these different cases. Then 
we cannot be sure that some ports on an operation are inputs and others are 
outputs (since these notions assume definite causal structure). In this case 
we cannot distinguish preparations, transformations, and results. Quantum 
theory might reasonably be expected to be obtained as a limit of quantum 
gravity (in the limit as we have definite causal structure). In this limit distinct 
notions of preparations, transformations, and results might emerge. However, 
fundamentally (before the limit is taken), they are not distinct. It would, then, 
be great if a set of postulates for quantum theory treated them more-or-less 
on an equal footing. Some of these postulates may then go over to a theory 
of quantum gravity. In [331 [Ml 12S] I developed the "causaloid framework" for 
general probabilistic theories that can accommodate indefinite causal structure. 
One can put quantum theory into this framework and it does, indeed, have 
the feature that preparations, transformations, and results are then treated on 
a (more-or-less) equal footing (indeed, the causaloid formulation of quantum 
theory is the origin of the operator tensor formulation) . 

I think that the real test of this research program will be the progress that 
is made towards a theory of quantum gravity using these newly developed tech- 
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niques. It is in constructing new physical theories that we can really test whether 
we are on the right path since then we have to make new predictions and account 
for new experimental data. Furthermore, the more fundamental our physical 
theory, the more natural we can expect our postulates to be. 
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